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Abstract 

One of the most challenging problems in complex dynamical systems is to control complex 
networks. In previous frameworks based on the structural or the exact controllability theories, the 
ability to steer a complex network toward any desired state is measured by the minimum number 
of required driver nodes. However, if we implement actual control by imposing input signals on 
the minimum set of driver nodes as determined, e.g., by the structural controllability theory, an 
unexpected phenomenon arises: the energy required to approach a target state with reasonable 
precision is often unbearably large, precluding us from achieving actual control, i.e., the designated 
state can not be reached in effect, especially for networks with a small number of drivers. In 
particular, the energy of controlling a set of networks with similar structural properties follows 
a fat-tail distribution, indicating the existence of networks with practically divergent energy. We 
aim to reconcile the paradox of controlling complex networks: optimal structural controllability 
versus unrealistic energy required for control. We identify fundamental structural “short boards” 
in complex networks that play a dominant role in the enormous energy, and offer a theoretical 
interpretation for the fat-tail energy distribution and simple strategies to significantly reduce the 
energy by imposing slightly augmented set of input signals on properly chosen nodes. Our findings 
indicate that, although full control can be guaranteed by the prevailing structural controllability 
theory, it is necessary to balance the number of driver nodes and the control energy to achieve 
actual control, and our results provide a framework to address this outstanding issue. 

Notes on the submission history of this work: This work started in late 2012. The phe¬ 
nomena of power-law energy scaling and energy divergence with a single controller were discovered 
in 2013. Strategies to reduce and optimize control energy was articulated and tested in Spring 
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I. INTRODUCTION 


The past fifteen years have witnessed tremendons advances in onr nnderstanding of com¬ 
plex networked strnctnres in various natural, social, and technological systems, as well as 
the dynamical processes taking place on themflMl^. The significant issue of control arises 
naturally, but this remains to be outstanding and extremely challenging, since nonlinear dy¬ 
namical processes generally take place on complex networks. Control of nonlinear dynamics, 
especially when chaos is present, can be done but only for low-dimensional systems (l^. 18 |. 
Despite the development of nonlinear control methods dKi in certain particular situations 
such as consensus 25|, communication j^, 27|, traffic ^ and device networks jl^, [^, a 
general framework of controlling complex nonlinear-dynamical networks has yet to be de¬ 
veloped. A natural approach is to reduce the problem to controlling complex networks with 
linear dynamics based on traditional frameworks from control engineering j^-34 . 


In the past a few years, great progress was made toward understanding the linear control¬ 
lability of complex networks in terms of the fundamental issue of the minimum number of 
driver nodes required to steer the whole network system from an arbitrarily initial state to 


an arbitrarily final state in finite time j35l-l41 


In particular, Liu et ah successfully adopted 
the classic structural controllability theory developed by Lin to com ple x networks of 


various top olog ies j36| . for which the traditional Kalman’s rank condition |30| is difficult to 
be applied [3^. The ground breaking results show that, the structural controllability of a 
directed network can be assessed by using the maximum matching j42-44| algorithm. The 
effects of the density of in/out degree nodes were incorporated into the structural control¬ 


lability framework which has also been applied to protein interaction networks 


Recently, based on the classic Popov-Belevitch-Hautus (PBH) rank condition in tra¬ 


ditional control engineering, a variant of the structural-controllability theory, the so-called 
exact controllability framework, was developed (i^ . 


For both the structural- and exact-controllability frameworks, the aim is to determine 
the minimum number of driver nodes. Ad, for networks of various topologies. However, we 
have encountered unexpected difficulties in carrying out actual control of complex networks 
by using the minimum set of driver nodes as determined by the controllability frameworks. 
This concerns effectively the issue of guiding the network system to approach a final state 
with acceptable proximity error. In particular, given an arbitrary complex network, once 
Ad is determined, we can calculate the specific control signals by using the standard linear 
systems theory and apply them at various unmatched nodes. A surprising finding is 
that, quite often, the actual control of the system is difficult to be achieved computationally 
in the sense that in any finite time, it is not possible to drive the system from an arbitrary 
initial state to an arbitrary final state, i.e., the actual state the system finally reaches is 
unreasonably far from the designated one. This difficulty in realizing actual control, which 
has not been formerly addressed in any other works, persists for a large number of model 
and real-world networks, prompting us to study if the developed controllability frameworks 
can ensure actual control with given finite computational precision and, more importantly, 
to consider the issue of control energy. 


In this paper, we investigate the issue of control energy in the framework of structural 
controllability theory. We find that, the energy required to steer a system from a specific 
initial state to a target state in finite time follows a fat-tail distribution, indicating the 


2 
































existence of extraordinarily high energy requirement. In extreme but not uncommon cases, 
the energy is practically divergent. This phenomenon signihes the emergence of a paradox in 
controlling complex networks: although a small fraction of driver nodes can guarantee full 
control of the network system mathematically, the energy required to achieve control is often 
unbearable. We resolve the paradox by presenting the idea of control chains, in which the 
fat-tail distribution of the energy can be derived as a key structural feature. The theory of 
control chains enables us to offer simple strategies to signihcantly reduce the control energy 
through small augmentation of the number of control signals beyond Nd. In this regard, 
the quantity A^d, on which the structural controllability theories focus, can effectively be 
regarded as the lower bound of the actual number of control signals required. To realize 
actual control of a complex network, it is imperative to hnd the trade-off between the number 
of external input si gnal s and feasible energy consumption. 

Remark. In Ref. ^ , partial theoretical bounds for the control energy were derived. The 
bounds are partial because, for example, for networks whose lower bounds can be obtained, 
the upper bounds typically diverge. This property of divergence was puzzling: does it mean 
that the actual energy required would diverge as well and, if so, can a complex network actu¬ 
ally be controlled? The present work was largely motivated by these questions, in which we 
obtain a detailed understanding of the physically important issue of practical controllability 
of complex networks through the discovery of a general scaling law for the distribution of 
the energy required for control. The existence of control chain is also uncovered, enabling 
us to articulate practical strategies to signihcantly reduce the control energy. 


II. CONTROL FORMULATION AND IMPLEMENTATION 


Optimal control energy and Gramian matrix. To calculate the optimal energy required 
to control a complex network in the framework of structural controllability, we consider the 


standard setting of linear dynamical systems under control input [33, [33, [48 


X = Ax -|- Ru, (1) 

where x = [a;i(f),... ,xjv(t)]^ is the state variable of the whole network system, the vector 
u = [ui(t),... ,UM(t)]'^ is the control input or the set of control signals, A = {a^} is the 
N X N adjacency matrix of the network, and B = {bik} is the N x Nd control matrix 
specifying the set of “driver” nodes [^, each receiving a control signal (corresponding to 
one component of the control vector u). The minimum number Nj) of driver nodes to fully 
control a network is determined through the set of maximum matching paths j^. A node 
is chosen to be the driver node if it is the starting point of a maximum matching path. The 
system is fully controlled only if each and every node is either a driver node or being driven 
along a maximum matching path. Optimal control of a linear network in the sense that the 
energy is minimized can be achieved when the input control signals are chosen as j^: 

ut = B^- • W-^ ■ (xt, - ■ xo), (2) 


where ^ 

IV= r e^^B- B^ -e^^^dr 

Jto 


( 3 ) 
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is the Gramian matrix, a positive-definite and symmetric matrix j^, which is the base to 
determine, quantitatively, if a system is actually controllable. In particular, the system is 
controllable only when W is nonsingular (invertible) (i^. . 

Given matrices A and B, the initial and the final (target) states of the system as^wll as 
the control time tf, the control vector u can be determined in a standard manner 
the Gramian matrix W. The energy required through the control input u is given by 



rk 

E{ti) = / 

Jo 


(4) 


where control is initiated at t = 0. 

Numerical implementation of control. We use the Erdos-Renyi (ER) type of directed 

and the Barabasi-Albert (BA) type of directed scale-free net- 


51, 52 


random networks 

works [l^ with a single parameter Pb- Specifically, for a pair of nodes i and j with a link, 
the probability that it points from the smaller-degree to the larger-degree nodes is Pb, and 
1 — Pb is the probability that the link points in the opposite direction (if both nodes have 
the same degree, the link direction is chosen randomly). (See Appendix A for analytical 
treatment of the in- and out-degree distributions.) To determine the set of driver nodes, 
we use the maximum-matching algorithm 3l|, which gives the control matrix B. For each 
combination of A and B, we first randomly choose the initial and final states. We then 
calculate the corresponding Gramian matrix W [Eq. ([2])], the input signal Uf [Eq. ([2])], the 
actual final states [Eq. ([T])], and finally the control energy P(tf) [Eq. (jl])]. Repeating 
this process for each and every independent network realization in the ensemble entails an 
extensive statistical analysis of the control process. 


III. RESOLUTION OF CONTROL PARADOX AND CONTROL-ENERGY DIS¬ 
TRIBUTION 

A. Quantification of control energy and resolntion of control paradox 


Mathematically, if the Gramian matrix W is singular, the energy diverges. Through ex¬ 
tensive and systematic numerical computations, we find that, even when W is non-singular 
in the mathematical sense, for typical complex networks its condition number can be enor¬ 
mously large making it effectively singular as any physical measurement or actual 
computation must be associated with a finite precision. Say in a physical experiment the 
precision of measurement is e. In a computational implementation of control, e can be 
regarded as the computer round-off error. Gonsider the solution vector X of the linear 
equation: W ■ X = Y, where Y is a known vector. Let Cw be the condition number of 
W. The accuracy of the numerical solution of X, denoted by ex = 10“^ {k is a positive 
integer), is bounded by the product between Cw and e [5^. We see that, if Cw is larger 
than 10“^/e = Cwi h is not possible to bring the system to within 10“^ of the final state, 
so control cannot be achieved in finite time. 

For a large number of networks drawn from an ensemble of networks with a pre-defined 
topology, the condition numbers of their Gramian matrices are often orders of magnitude 
larger than Cw (see Fig. Al in Appendix B1 for the relation between Cw and ex)- For these 
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networks, not only is the control vector unable to drive the system to the target state, but the 
associated energy can be extremely large. These observations suggest the following criterion 
to dehne controllability in terms of the control energy: a network is controllable with respect 
to a specihc control setting if and only if the condition number of its Gramian matrix is less 
than Cwi a critical number determined by both the measurement or computational error 
and the required precision of control. Quantitatively, for a given set of network parameters 
(hence a given network ensemble) and control setting, the probability that the condition 
number of the Gramian matrix is less than Cw, P{Cw)i can effectively serve as a new type 
of controllability, which we name as practical controllability. Increasing the precision of 
the computation, e.g., by using special simulation packages with round-off error orders of 
magnitude smaller than that associated with the conventional double-precision computation, 
would convert a few uncontrollable cases into controllable ones, but vast majority of the 
uncontrollable cases remain unchanged. 

Figures [T](a-b) show the percentage of driver nodes = Nd/N versus the directional link 
probability Pb- We see that is minimized for Pb ~ 0.5, indicating that, mathematically, 
only a few control signals are needed to control the whole network, leading to optimal 
structural controllability. But can practical controllability be achieved in the same parameter 
regime where the structural controllability is optimized? 

Figure [T](c) show, for the same networks as in Fig. [I](a), the measure of control energy, 
i.e., the probability P{Cw), versus the network parameter Pb. We see that, for both regimes 
of small and large Pb values where the structural controllability is weak [corresponding to 
relatively high values of ud in Fig. [11(a)], the practical controllability is relatively strong. 
In the regime of small Pb values, most directed links in the network point from small- to 
large-degree nodes. In this case, the network is more practically controllable, in agreement 
with intuition. The surprising result is that, in the regime of intermediate Pb values (e.g., 
Pb around 0.5) where the number of driver nodes to control the whole network is minimized 
so that the structural controllability is regarded as strong, the practical controllability is 
in fact quite weak, as the probability of the condition number being small is close to zero. 
For example, for (k) = 4, the minimum value of P{Cw) is only about 0.1 for Pb ~ 0.6; for 
{k) = 6 and {k) = 8, the minimum values are essentially zero. A striking phenomenon is 
that the minimum value of P{Cw) occurs in a wide range of the parameter Pb, e.g., [0.3, 0.8] 
and [0.2,0.9] for (k) = 6 and (k) = 8, respectively. This indicates that the network is 
practically uncontrollable for most cases where the structurally controllability is deemed to 
be optimal. The same phenomenon holds for different network sizes (see Fig. A2 in Appendix 
B2). Another interesting hnding in Fig. [T]is that is symmetric about Pb = 0.5. However, 
the symmetry is broken for P(C'w), indicating that there is no simple negative correlation 
between and P{Cy^). This prompts us to hnd more essential structural properties 
responsible for the smallness of P{Cy/). 


B. Concept of control chains and distribution of control energy 

Suppose the network is practically controllable so that the required control energy is 
not unrealistically large. For an ensemble of randomly realized network conhgurations with 
the same structural properties and for different control settings, the control energy can be 
regarded as a random variable. What is then its probability distribution? To gain insights. 
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FIG. 1: Structural and practical controllability measures in directed networks. Structural 
controllability measure no versus directional edge probability Ph for (a) ER random networks and (b) BA 
scale-free networks of size N = 1000 and three values of the average degree {{k) = 4, 6, and 8). The 


dash-dotted lines represent the results obtained by the cavity method 36l|. a nd the squares, triangles, and 


circles are the simulation results from the maximum matching algorithm 36|. (c,d) Measure of practical 
controllability P{Cw) for ER random and BA scale-free networks of size N = 100, respectively, where 
PiCw) is the probability that the condition number of the Gramian matrix is less than some physically 
reasonable threshold value versus Ph- Comparing (a) with (c), or (b) with (d), we observe the striking 
phenomenon that, in the parameter regime where the number of driver nodes is minimized so that the 
corresponding networks are deemed to be most structurally controllable, they are practically 
uncontrollable. The phenomenon persists regardless of the network size and type. 


we generate directed networks with different values of (k) and Ph- We then implement 
the maximum matching algorithm to obtain the control matrix B and calculate the 
minimum energy by using Eq. (jl]) for final time tf. For each network, the initial states xq 
and desired final states are randomly chosen. The calculation of energy is done only 
for those networks with condition number smaller than Cw, and a variety of Cw values are 
adopted. Representative results are shown in Fig. [2l where an algebraic (power-law) scaling 
behavior with fat tails is observed for all cases with the scaling exponent approximately equal 
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FIG. 2: Distributions of control energy for practically controllable networks. (a,c) Energy 
distributions under different values of the threshold condition number Cw for Pb = 0.1 and t{ = 1. (k) = 6 
for random networks (a) and (k) = 8 for scale-free networks (c). (b,c) Energy distributions for different 
values of the average degree (k) and the directional connection probability Pb under Cw = 10^^ and t{ = 1 
for ER random and BA scale-free networks, respectively. In all cases, we observe an algebraic (power-law) 
scaling behavior. 


to 1.5. The scaling is robust against various Cw values [Figs. |21[a) and (c)] and network 
sizes (see Fig. A3 in Appendix B3). From Figs. [2](a) and (c), we see that different values of 
Cw result in different groups of practically controllable networks, and the required control 
energy in general increases with Cw- In Figs. |2][b) and (d), the value of Cw is fixed and the 
control energy required is larger for larger value of Pb ns compared with the case of Pb = 0. 
This is intuitively correct as, for Pb = 0, all directed links point from small- to large-degree 
nodes, facilitating control of the whole network. 

We develop a physical understanding of the large control energy required and also the 
algebraic scaling behavior in the energy distribution. To gain insights, we first consider a 
simple model: an unidirectional, one-dimensional (ID) string network, for which an analytic 
estimate of the control energy can be obtained (see Appendix C) as 

El « K], ( 5 ) 
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FIG. 3: Relationship among ID chain energy, the smallest eigenvalue of iJ-matrix, and 
network control energy, (a) For an ID chain network of length I, energy Ei and versus /, providing 
support for the analytic result Eq. ([S]). (b) Correlation between (E), the average of control energy for 
networks with the same LCC length, and E^, the energy of a LCC with length Dq = F (E = 3, 4, 5, 6, 
and 7 for ER and E = 3, 4, 5, and 6 for BA networks), calculated from ensembles of 10000 networks. 


where Ei denotes the energy required to control a ID string of length I (the number of nodes 
on the string) and Xhi is the smallest eigenvalue of the underlying hf-matrix, denoted by Hi, 
which is related to the Gramian matrix hy H = The condition number of the 

ID chain system increases exponentially with its length. For example, the value of Cw of a 
chain of length larger than 7 has already exceeded Cy/ = 10^^. This indicates that, even for 
a simple ID chain network, the energy required for control tends to increase exponentially 
with the chain length. Numerical verihcation of Eq. ([S]) is presented in Figs. El^a)- Although 
Eq. ([ 2 ]) is obtained for a simple ID chain network, we hnd numerically that it holds for 
random and scale-free network topologies (See Fig. A5 in Appendix D). 

The relation Eq. ([5]) and Fig. [3](a) provide an intuitive explanation for our Ending that 
applying control signals to the minimum set of driver nodes calculated from the structural 
controllability theory typically requires enormously large energies. Under the theory of struc¬ 


tural controllability, a network is deemed more structurally controllable if A^d is smaller 36 


However, as the number of driver nodes is reduced, the length of the chain of nodes that 
each controller drives on average must increase, leading to an exponential growth in the 
control energy. In the “optimal” case of structural controllability where A^d = 1 is achieved, 
the length of the control chain will be maximized, leading to unrealistically large control 
energy that prevents us from achieving actual control of the system. 

The idea of exploiting the length of control chain can also be used to explain the algebraic 
scaling behavior in the energy consumption. As discussed, identifying maximum matching 
so that the network is deemed structurally controllable is independent of the control energy. 
However, when maximum matching is found, we can divide the whole network into A^d 
control signal paths (CSPs), each being a unidirectional ID string led by a driver node 
that passes the control signal onto every node along the path, as illustrated by the vertical 









paths in Fig. HJ^a). CSPs thus provide a picture indicating how the signals from the Nd 
external control inputs reach every node in the network to ensure full control (in the sense 
of structural controllability). 




FIG. 4: Schematic illustration of various concepts to characterize and understand the 
practical controllability of a network, (a) Control-signal paths (CSPs) of a random network obtained 
from maximum matching in structural controllability theory, where a control signal enters a CSP via the 
corresponding driver node (yellow), the starting node of the path, and goes through each matched node 
(blue) along the path. In this example, the network has nu = 10 CSPs of different lengths. Non-path 
links, links that are irrelevant to matching, are displayed in green, (d) All possible LCCs in the network. 
Typically there are multiple LCCs of the same length. In this example, the length of the LCCs is 4, which 
is defined as the control diameter of the network. Two LCCs sharing no common nodes are marked by red 
nodes and solid red arrows. Links belonging to other LCCs are marked by red dashed arrows. CSPs are 
denoted using letters a to j from the left to the right. Each node is specified using its path number and its 
position along the path sequentially from top to bottom. For example, node el is the driver node of path 
e, and node h2 is the node right after the driver node on path h. The two LCCs with solid arrows are 
listed in bold. Eight LCCs in the network converge to only three end-nodes, e6, /5, and /6 (marked by red 
dashed circles), leading to LCC degeneracy to = 3. The control energy is determined by any randomly 
chosen to LCCs among all existent ones. 


We can distinguish two types of links: one along and another between the CSPs, as shown 
in Fig. 111(a). It may seem that the latter class are less important as the control signal and 
energy flow along the former set of links. However, due to coupling, a node’s dynamical 
state will affect all its nearest neighbors’ states which, in turn, will affect the states of their 
neighbors, so on, and vice versa. In principle, any driver node connects with nodes both 
along and outside its CSP. Correspondingly, an arbitrary node in the network is influenced 
by every driver node, directly through the CSP to which it belongs, or indirectly through 
the CSPs that it does not sit on. Intuitively, the ability of a driver node to influence a node 
becomes weaker as the distance between them is increased. In order to control a distant 
node, exponentially increased energy from the driver is needed. The chain starting from a 
driver node and ending at a non-driver node along their shortest path is effectively a control 
chain. We can define the length of the longest control chain (LCC), Dq, as the control 
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diameter oi the network, as shown in Fig. There can be multiple LCCs. The node 

at the end of a LCC is most difficult to be controlled in the sense that the largest amount 
of control energy is required. The number m of such end nodes dictates the degeneracy 
(multiplicity) of LCCs. An example is shown in Fig. 111(b), where we see that, although there 
can be multiple LCCs, the ends of them converge to only three nodes, leading to m = 3. 
Since the energy required to control a ID chain grows exponentially with its length in such 
a way that even one unit of increase in the length can amplify the energy by several orders 
of magnitude [Fig. [3](a)], the energy associated with any chain shorter than the LCC can 
typically be several orders of magnitude smaller than that with the LCC. Thus, the total 
energy is dominated by the LCCs. Due to the low value of typical m (see Fig. A6 in Appendix 
El), a single LCC essentially dictates the energy magnitude of the whole system. As shown 
in Fig. [3](b), actual network control energy shares strong positive correlation and similar 
magnitude to the LCC energy defined as the energy of a LCC of the corresponding 
network, especially for networks with long LCCs. Intuitively, the probability to form long 
LCCs is small. Accordingly, a longer LCC tends to have smaller value of degeneracy m. As 
a result, the longest LCCs have almost no degeneracy (m = 1) so that they effectively rule 
the control energy of the whole network (see Appendix El). 

The construction in Fig.|l]thus provides a structural profile to estimate the control energy. 
In particular, a network can be viewed as consisting of a set of structural elements, the control 
chains, interacting with each other via the links among them, and interactions among these 
basic structural elements usually play an important role in determining the properties of a 
physical system. Hence, the total energy E required has two components: Ei, the sum of 
energies associated with all control chains, and ^ 2 , the interaction energies among the chains. 
Observed from Fig. [n](b), E 2 is important for networks with short LCCs (higher m values). 
The energy scaling relation shown in Fig. [2] can be derived by devising appropriate models to 
analyze the contributions from the two components. We have developed two such models. 
The hrst is the LCC-skeleton model, which only takes Ei into account and provides an 
analytic estimate of the control energy distribution function as well as the scaling exponent. 
The second is the double-chain interaction model, in which a system consisting only two 
interacting control chains captures the key features of the entire network by characterizing 
the essential effect of interaction energy among the structural elements. These two models 
combined serve as a framework to determine the energy profile associated with controlling a 
complex networked system, providing a deep understanding of practical controllability (see 
Appendix E for details of the two models). 


IV. CONTROL OF REAL-WORLD NETWORKS 
A. Control of an electrical circuit network 

To further test the concept and framework of practical controllability, we consider a 
real one-dimensional cascade parallel R-C circuit network, as schematically illustrated in 
Fig. [5](a). The network can be represented by a bidirectional ID chain with self-loops for 
all the nodes, as shown in Fig. [5](b). The network size can be enlarged, say by one unit, 
by attaching an additional branch of resistor and capacitor at the right end of the circuit. 
The state Ui{t) of node i at time t is the voltage of capacitor i, and the input voltage u{t) 
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FIG. 5: Cascade parallel R-C circuit and its corresponding network presentation, (a) A 

cascade parallel R-C circuit with L = 7 resistors (i?i, i? 2 , ■ • and Rl, each of resistance Ifl) and 7 
capacitors {Ci, C 2 , ■ ■ ■, and Cl, each of capacitance IF). External voltage input u{t) is applied onto the 
left side of the circuit, and the voltage of capacitor Ci is Ui{t){l < i < L). (b) Network representation of 
the circuit in (a) as a bidirectional ID chain network of seven nodes, where the external voltage input u{t) 
is injected into node 1 (yellow driver node, the controller). The dynamical state of node i is described by 
the voltage on its capacitor, Ui(t). Links (blue) between nodes are bidirectional and have uniform weight 1 
in either direction. Each node has a self-link (red) of weight —2, except the ending node (node 7) whose 
self-link has weight —1. (c) The circuit network in (b) with an extra external current input ie{t) into the 
capacitor C 3 , where 13 and 14 denote the currents through resistors i?3 and i?4, respectively. In the absence 
of the extra current input, i^(t) — i 4 {t) is the current through the branch of < 73 . (d) The extra external 
current input ie(t) serves as a redundant control input injected into node 3 of the network in (b). Now 
there are two driver nodes (yellow) in the network, nodes 1 and 3. 


represents the control signal. The purpose of control is to drive the voltages of the capacitors 
from a set of values to another within time tf through the input voltage u{t). The control 
energy can then be calculated by Eq. (jl]). The actual energy dissipated in the circuit during 
the control process is given by 


^real = f U(t) ■ I(t)dt, (6) 

Jo 

where U(t) = u(t) and I{t) are the input voltage and current at time t, and Ereai is in 
units of Joule. By making the circuit equivalent to a ID chain network, we have three types 
of energy: the control energy of the actual circuit calculated from Eq. (j4]), the dissipated 
energy of the circuit from Eq. ([6]), and the control energy of the ID equivalent network. 
Figure [6](a) shows that the control energy and the dissipated energy of the circuit do not 
differ substantially from the energy calculated from unidirectional ID chain. Among the 
three types of energy, the energy cost associated with the control process, as calculated from 
Eq. ([6]), is maximal. 
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FIG. 6: Control energy and optimization for ID chain and cascade parallel R-C circnit. (a) 

Energy required for controlling a unidirectional chain (red) and the corresponding circuit (blue) as well as 
the dissipated energy of the circuit calculated from Eq. (I6|) versus the chain length L. (b) Control and 
dissipated energies in the presence of a redundant control signal to node z (i > 1), which breaks the chain 
into two subchains of lengths i and L — i, respectively. 


B. Strategies to balance control energy and extra inputs 

Our finding of the LCC structure associated with the control and the exponential growth 
of energy with the length of LCCs suggest a method to reduce the energy signihcantly. 
Since the key topological structure that determines the control energy is LCCs, one possible 
approach is to reduce the length of all the LCCs embedded in a network by making structural 
perturbations to the network. This, however, will inevitably modify the network structure, 
which may not always be practically viable. Is it possible to reduce the control energy 
without having to change the network structure? One intuitive method is to apply additional 
controllers beyond those calculated from the structural-controllability theory, which we name 
as redundant controllers. A straightforward solution is to add some redundant control signals 
along the LCCs. To gain insights, we consider a unidirectional ID chain and add a redundant 
control input at the fth node. As shown in Fig. [6](b), the magnitude of control energy is 
reduced dramatically. The optimal location to place the extra control should be near the 
middle of the chain so as to minimize the length of LCCs using a minimal number of 
redundant control signals. As can be seen from Fig. [6t^b), this simple strategy of adding one 
redundant control signal can reduce the required energy by nearly seven orders of magnitude! 
More specihcally, the redundant control signal to node i breaks a chain of length L into two 
shorter subchains: one of length i — \ and another of length L — i + 1. Roughly, the 
control energy is the sum of energies required to control the two shorter components, which 
is dominated by energy associated with the longer component owing to the exponential 
dependence of the energy on the chain length. By choosing i around L/2, the length of the 
longer part is minimized. For the circuit network in Fig. O the redundant control input 
can be realized by inducing external current input into a capacitor. As shown in Fig. [5](b), 
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FIG. 7: Augmented control inputs and energy optimization, (a) Densities of the original driver 
nodes no (purple) and of the augmented controls — hd (pink), (b) Normalized energy reduction 
Ai?*^id/F* = {E* — EE^)/E* (blue) when an additional control signal is added to the middle of each LCC 
[strategy (I)], (c) Normalized energy reduction AE*^^/E* = [E* — E*^^)/E* (green) when an additional 
control signal is added to the end node of each LCC [strategy (II)]. For the blue bars (or green bars), the 
optimized control energy (or F*end) is several orders of magnitude smaller than E-k. The bars with 

more gray portion the blue (or green) potions are for the networks with relatively low values of Ek and Dq 
(see Table A2 in Appendix F), for which energy optimization is not necessary. The bar corresponding to 
“Florida” (the most right) is striped due to the fact that this network is not controllable (i.e., with 
divergent energy) even if M* augmented control inputs are added. 


a reduction in energy of nearly 10 orders of magnitude is achieved. Applying a single 
redundant control input can thus be an extremely efficient strategy to reduce the required 
control energy for the one-dimensional chain network. 

Due to the fact that there can be multiple LCCs converge at the same end node, applying 
a control signal to each of the m nodes that all LCCs converge into is another strategy that 
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reduces the total number of redundant controls, while also signihcantly shrinks the control 
energy. (Detailed demonstrations of the enhancement strategies for physical or modeled 
networks and an implementation example on a circuit system are presented in Appendix 
G.) 


C. Control of real-world networks 


Can real-world complex networks be actually controlled? In Ref. [3^, the structural 


controllability of a large number of real-world networks were investigated, with the conclusion 
that optimal control of most of the networks can be achieved with only a few control signals. 
We investigate control energies of the same set of real-world networks (see Table Al in 
Appendix F for network details) and find that, when optimal control is applied according 
to maximum matching, most of the networks require realistically high energies. In fact, 
15 out of the 18 networks are practically uncontrollable. The main reason lies in the large 
LCCs of most of these networks. Another factor is that there are subgraphs that are not 
connected with each other and/or a large number of topological motifs such as loops, self¬ 
loops, or bidirectional edges. More strikingly, even with unlimited energy supply, the number 
of driver nodes as determined by the maximum matching algorithm from the structural 
controllability theory is generally insufficient to fully control the whole system, where there 
exists a number M* of nodes that never converge to their target states. These observations 
lead to the speculation that, in order to fully control a realistic network, more driver nodes 
are needed than those identified by the structural controllability theory. That is, more 
independent control signals are needed than those determined by maximum matching to 
drive all nodes in the network to their target states. The M* uncontrollable nodes are thus 
the required augmented set of driver nodes, each with an external control input. In total, 
A”/) = Njj + M* driver nodes need to be deployed to gain full control of the system [see 
Fig. [TKa) for riu and = N^/N for the 18 real-world networks]. Applying control signals 
to the nodes as determined by maximum matching and to the augmented driver nodes, we 
find that 17 out 18 real-world networks become practically controllable (see Table A2 in 
Appendix F). 


We also test the enhancement strategies using the 18 real-world networks, with the result 
that their practical controllability can be markedly enhanced (especially for those with large 
control diameters), as shown Figs. [3(b) and (c). We see that, for each of the real-world 
networks with unrealistically large energy requirement (see Table A2 in Appendix F), the 
optimized control energy (or E*^^) is several orders of magnitude smaller than the 

value of the original energy E* (the control energy with M* augmented driver nodes but 
without any redundant control input). This indicates the effectiveness of our optimization 
strategies. (In fact, strategy (I) works better than (II) in most cases.) For the networks 
with small control diameters, even without applying any enhancement strategy the control 
energies required are already much smaller than those for the other networks. For these 
networks energy optimization is practically unnecessary (see also Table A2 in Appendix F). 


We also hnd that increasing the control time tf can reduce the control energy so as to 
enhance the network’s practical controllability. 
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V. CONCLUSIONS AND DISCUSSIONS 


As stated in Ref. [3^, the ultimate proof that one understands a complex network com¬ 
pletely lies in one’s ability to control it. We discover a paradox arising from controlling 
complex networks with respect to control energy and the number of external input signals. 
To resolve the paradox, we focus on the situation where the structural-controllability theory 
yields a minimum number of external input signals required for full control of the network, 
and determine whether in these situations the control energy is affordable so as to real¬ 
ize actual control. Our systematic computations and analysis reveal a rather unexpected 
phenomenon: due to the singular nature of the control Gramian matrix, in the parameter 
regimes where optimal structural controllability is achieved in the sense that the number of 
driver nodes is minimized, energy consumption can be unbearably large. To obtain a more 
systematic understanding, we identify the fundamental structures in a network under the 
action of control signals, the longest control chains (LCCs), and argue that they essentially 
determine the control energy. We articulate and validate that the required energy increases 
exponentially with the length of the LCCs. In situations where the required number of 
controllers is few as determined by the structural controllability theory, the length of LCCs 
tends to be long, leading to practically divergent control energy. Another hnding is that, 
for minimum input signals, the required energy exhibits a robust algebraic scaling behav¬ 
ior, which can be explained by analyzable models constructed based on interacting LCCs. 
The discovery of the LCCs associated with controlling complex networks leads naturally to 
a simple method to resolve the paradox: increasing the number of controllers by placing 
extra control signals (beyond the number determined by the structural-controllability the¬ 
ory) along the LCCs. Indeed, test of a large number of real-world networks shows that, 
while they are structurally controllable j^, most of them exhibit enormous energy con¬ 
sumption. They can actually be controlled by placing more drivers than determined by the 
structural-controllability theory at proper locations along the LCCs. 

Our work indicates that the difficulty of achieving actual control of complex networks 
associated with even linear dynamics is beyond the current knowledge in the field of network 
control. Although the controllability theory offers a theoretically justihed framework to guide 
us to apply external inputs on a minimum set of driver nodes, when we implement control 
to steer a system to a desired state, the energy consumption is likely to be too large to be 
affordable. This finding suggests that, to achieve control of a complex networked system, the 
existing controllability framework merely offers a necessary rather than a practically feasible 
condition to assure actual control. We thus demand a more comprehensive and practically 
useful theoretical framework for addressing the extremely important issue of controlling 
complex networks. However, it is difficult to develop such a framework at the present and 
we do not even know if a mathematically justihed theory is available based on the current 
knowledge. Another issue is that for general networked nonlinear systems, we continue to 
lack the necessary condition based on the present controllability framework, as well as an 
understanding of required control energy. So far, we still know too little about controlling 
complex networked systems, and further effort is needed to address this challenging but 
greatly important problem shared by a wide range of helds. 
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APPENDIX A: ANALYTICAL CALCULATION OF THE IN- AND OUT-DEGREE 
DISTRIBUTIONS 

The in- and out-degree distributions of a directed complex network under connection bias prob¬ 
ability A = Pb can be obtained analytically. 

Defining ks and /cl to be the numbers of nodes in the neighborhood of a node with degree k, 
whose degrees are smaller or larger than than k, respectively, we have 




u rk r<h. fk 

— k'■ Ck'-^ dk'= — k'^-^dk', (Al) 


and 


where 


Therefore, 


^7 7-fcmax rihh-'y 

ki^ = kY P{k'\k) = ^ / k’^-^ dk' -r = Ak^-\ 


(fc)(7 - 2) 


A = 


C 


(A;)(7-2)' 

ks = k - /cl- 


(A2) 

(A3) 

(A4) 


We then have 

= Afcs + (1 - A)A:l = A(A; - k^) + (1 - A)A:l = (1 - 2A)A:l + \k = {1 - 2X)Ak^-^ + Xk, (A5) 

and 

Mn = (1 - X)ks + AA:l = (1 - X)k + (2A - 
The quantities P{kont) and P{k\a) can be derived from 

P(feout) dfcout = P{k) dk and P(/cin) dh^ = P{k) dk, 

which yield 


dk dk 

P{kont) = P{k)-r, - and P{h^) = P{k)- 


dfeout dfein 

Setting k = fi^{kovLt) and k = we can obtain the distributions. 

Using 

kout = fi{k) and hr, = f 2 {k), 


(A6) 

(A7) 


we obtain 


and 


P(fcout) - P{k) - P{fi H^out)) , 


1 


P{hr,)=P{fh\hr,))-^, 


(AS) 


(A9) 


m2 (kin)) 

In general, it is difficult to obtain an explicit expression. However, for some specific values of A or 
7 , analytical results are available. 


Case I: A = 0.5. 
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In this case, we have 


and 


Thus 


^out — ^in — 2 ’ 


k — 2/cQ^t, — ‘Ih'iYi- 


P{kont) = 2P{2kout) = 2C(2A:out)“^ = 2^-^Ck-X 
Akin to P{kout), we have 

Pihr,) = P{kont) = 2^-'^Ckr;^. 

Case II: A = 0. 


(AlO) 

(All) 


= Ak^-'^ ^k = 


kout ® 

~T ' 


We then have 


(A12) 


U/l \ ^ f kont\^ 'I' 1 C . 3 

Case III: A = 1. 


which yields 


k,^ = Ak^-'r ^ k = 


P(fcin) = P{kout) = :^A^k-£p 
3-7 


Case IV: 7 = 3. For example, for BA model, we have 

^out = (1 “ 2A)A + Xk, 


and 


Pikont) = C 


k = 


kont + (2A — 1)A 


^out + (2A — 1)A\ 1 


A 


A 


= CA^ 


-1 


A 


^out+(2A—1) 


_C 

W) 


-7 


= CA^ 


-1 


(A13) 

(A14) 

(A15) 

(A16) 

n -7 


^out+(2A—1)A:„ 


Similarly, we have 

and 


/cin = (2A - 1)A + (1 - \)k, 
Mn T (1 ~ 2A)A 


k = 


1-A 


so 


Pihn) = C 


kin + (1 “ 2A)A 

r^^A 


-7 


1 


1-A 


= C(1 - \y-^[kin + (1 - 2A)fe„,in]-T 


(A17) 

(A18) 

(A19) 
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Figure A1 | Condition number Cw versus control error ex for random and scale-free 
networks. Network size is iV = 100 for (a-d) and 200 for (e-h), average degree is (fc) = 6 for ER random 
networks [(a),(b),(e), and (f)] and 8 for BA scale-free networks [(c), (d), (g), and (h)]. Directional link 
probability between any pair of nodes is Pb =0.1. Panels (a),(c),(e), and (g) show the scaling relation 
between the condition number Cw and control error ex- Panels (b), (d), (f), and (h) show the fraction 
Rcw of the networks with a certain Cw number. The scaling relation holds within some Cw-^x region 
with boundaries specified as the black dashed lines. The ex values are not physically meaningful outside 
the boundaries that are defined according to the precision limit of computation. The thresholds of Cw and 
ex used in the computations are 10^^ and 10“^, respectively, which are indicated as the blue dashed lines. 
The threshold values are chosen to lie within the physical boundaries so that the calculations for all Cw 
values are meaningful. 


APPENDIX B: ADDITIONAL NUMERICAL RESULTS 

Bl: Condition number and control error. The correlation between the condition number 
Cw and the control error ex is shown in Fig. lAll We observe that, within a certain range of Cw, 
an approximate scaling relation exists between Cw and ex, shown in panels (a), (c), (e), and (g). 
However, the scaling disappears outside the shown Cw range. The reason is that, outside the Cw 
range, the Gramian matrix W is ill conditioned, leading to considerable errors when computing 
the matrix inverse. In principle, the scaling regime can be extended with improved computational 
precision, but not indefinitely. 

B2: More on structural and practical controllability measures. Figure |A2] shows the mea¬ 
sure of the structural controllability, ud, and the measure of the practical controllability, P{Cw), 
versus Pb for ER random and BA scale-free networks of size N = 200. We see that the structural 
and practical controllability cannot be simultaneously optimized irrespective of the network size. 

B3: More on control energy power law distribution. Figure IA3I shows the robustness of 
the control energy power-law distribution against varying network size for both random [(a)] and 
scale-free [(b)] topology. 
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Figure A2 | Practical controllability measures for directed networks. Measure of the practical 
controllability, P{Cw), for (a) ER random networks and (b) BA scale-free networks of size N = 200. 



Figure A3 | Distributions of control energy for practically controllable networks under 
different network sizes. Panels (a) and (b) are for random networks with (k) = 6 and scale-free 
networks with (k) = 8, respectively, for two values of the network size {N = 100 and N = 200), where we 
set Cw = 10^^ and t{ = 1. In all cases, we observe an algebraic (power-law) scaling behavior. 
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APPENDIX C: CONTROL ENERGY OF ONE-DIMENSIONAL STRING 


As shown in Fig. IA41 the energy required to control a unidirectional ID string nearly overlaps 
with that of a bidirectional one with identical weights. In fact, if the chains are not too long, the 
relative difference in the energy between the two case are within the same order of magnitude. Here 
we provide an analytical calculation of the control energy for a bidirectional ID chain network. 

The energy E is given by 

E{ti) = Xq • ■ xo, (A20) 

where H = ■ W ■ xq is the initial state of the network, and W is the Gramian 

matrix. Since H is positive definite and symmetric, its inverse can be decomposed in terms 
of its eigenvectors as = QAQ^, where Q = [qi,q 2 , ■ ■ ■ ,Qn] is composed of the orthonormal 
eigenvectors that satisfy QQ^ = Q^Q = I, and A = diag{Ai, A 2 , ■ ■ ■ , Aw} is the diagonal eigenvalue 
matrix of in descending order. Numerically, we find that Ai is typically much larger than 
other eigenvalues. We thus have 

n 

E{tf) = x^QAQ^xo = ^ Xiiqfxof « Ai(gfxo)^. (A21) 

i=l 

Since xq can be chosen arbitrarily, we set xq = [1, 0,..., 0]^, so Eq. (IA21D becomes 

E{ti) « Xiiqfqi) = Ai. (A22) 

For an undirected network, the adjacency matrix A is positive definite and symmetric. We 
can decompose A into the form A = VSV'^, where the columns of V constitute the orthonormal 
eigenvectors of A and S = diagjsi, S 2 , ■ ■ ■, sw} is the diagonal eigenvalue matrix of A in descending 
order. We thus have H = = Ve~^^^V'^WV . Let 


Ah = diag{Awi, Xh2: ■ ■ ■ ■, ^Hn} = diagll/Aw, l/Aw-i, • • •, l/^ij 


be the eigenvalue matrix of H in descending order. The energy can thus be expressed as 

E{tf) ^ Ai(gfxo)2 = A;^^(gfxo)^. 

Letting Ay/ = diag{Awi) Aw 2 ) • • • j-^Wjv} be the eigenvalue matrix of W in descending order. 
We can approximate the eigenvalue of H by W, which has been numerically validated: Ah ~ Ayr. 
We thus have 

e « Ai(gfxo)2 = A;^^(gfxo)2 A^ygfxo)^. (A23) 

Since orthonormal transform does not alter the eigenvalues of a given matrix, we have Ah = 

For an undirected chain, the adjacency matrix is 


A 


0 1 
1 0 

1 ■•. 1 
0 1 
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control matrix is -B = [1, 0,... , 0]^, and eigenvalues and eigenvectors of A are 


Si = 2 cos 


TT 


A^ + 1 


i] , i = 


= 


■ sin 


iV + 1 VA^ + l 


TT 


ij , = 1, ■ ■ ■ 


(A24) 

(A25) 


Recall that H = Ve BB'^Ve^^ dt)e . Substituting this in Eqs. (IA24D and 

(lA25p . after some algebraic manipulation, we obtain 


H = -^—VWPWV^, 
iV + 1 


(A26) 


where 


W = 


sin(0) 


sin(20) 


sin(A^0) 


J NxN 


and 


r2tt 

Jo 


Pjk — I e 
>0 


[cos{j6)+cos{k8)]t 


with 0 = Trj{N + 1), j,k = 1, - ■ ■ , N. 

As a result, we have S = diag{2cos(-jy^), 2cos(-^^^),..., 2cos{^^)}. The minimum eigenvalue 
of H is given by 

Ntt \i 


The Rayleigh-Ritz theorem can be used to bound P as: 

y^Py 


^Pn — T — ’ 

r y 


(A27) 


(A28) 


where y = [yi,y 2 , ■ ■ ■ ,yN]"^ is an arbitrary nonzero column vector, Ap^ and Ap^ are the maximal 
and minimal eigenvalues of P, respectively. Letting T = 2tf, we have 


y'^Py = (yi • • • yN) 


f g-[cos(i0)+cos(fce)]r 

Jo 




NxN 

\yN/ 


N j 

Y. y^y>^ 

Jo 


g-[cos(j6»)+cos(fce)]r 

j,k=l 
N 

j,k=l 

N N 

i=i i=i 


(A29) 


with {f,g) = Jq fgdr. 
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Figure A4 | Comparison between energies required to control a unidirectional and a 
bidirectional ID chains: (a) energies required to control a unidirectional chain Funi (red) and a 
bidirectional one Fbi (blue) versus chain length L, and (b) the relative energy difference \Enni — Ehi\/Euni 
versus chain length L. 


Letting bj = e and performing a Taylor expansion of 6*- around t 


0, we obtain 


N-l 




6- = [- cos(ie)l‘^ + I- 


k=0 


m 


(A30) 


with tj € [0,T]. Now letting 


N-l 


[-cos(i0)]^ —, 


k=0 


we have bj = qj{t) + [— cos{j9)]^ ■ /N\). Consequently, the numerator in the Rayleigh quotient 
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can be expressed as 
N 


N 


j=l j=l 

N / 


^-N' 


N 


^N' 


(Yl 1cos(i0)]^^ 1 ’ 1 + yj^~ cos(j0)]^^ ]) 


i=i 

N 


N 


N 


i=i 


m 


N 




--(Y yjyji*)^ Y yi[-cos(j0)]^^) 


i=i 


i=i 


i=i 


i=i 


iV 




.N 


+ (Z] %[-cos(i0)]^^, ^ %[-cos(i0)]^^) 


i=i 


i=i 


N 


N 




N 


< 


iY yjyj(^)^Y 


i=i 


i=i 


k=l 


m‘ 


N 


CY 


Denote as A'l 
'W \ 2 -/V 


Denote as i('2 


+ (w) E \y^y>^\^- 

j,k=l 


(A31) 


Denote asK^ 

Since y = [yi,y 2 , ■ ■ ■, is an arbitrary nonzero column vector, for each N and T, we can choose 
y = ym insofar as Ki and K 2 are relatively small compared with K'^. We can normalize y'^y to 
arrive at 


X ^ vLPym 

^Pn ^ 


vLym 


rpN \ 2 

) E ~ o 

■ ^ i,fc=i 


'j^2N 


(A32) 


where Apj^ is the smallest eigenvalue of P. Recall that P is symmetric and positive definite, 
using Cholesky decompostion we can obtain its factorization M| as P = LL^, where L is the 
lower triangular matrix with its diagonal filled with square roots of eigenvalues of P. Therefore, 
Eq. ()A26h can be written as PI = j^^VWLL"^WV'^. Since orthonormal transform does not change 
the eigenvalues of a matrix, H has the same eigenvalues as R = LL'^W = L{WV)^. 

Suppose Ap = diag{Ap^, Apj,..., Ap^} is the diagonal eigenvalue matrix of P in descending order. 
The jth eigenvalue of R satisfies 


A/?, = 


-Ap.(sinA:0)^ < -Ytt^Pp 


A^ + 1 “ A^ + 1 

where j and k run from 1 to n. The control energy P(tf) can then be approximated as 

{NlP' 


E{R)^X-hI^O({N + 1) 


+2N 

Cf 


(A33) 


APPENDIX D: CORRELATION BETWEEN NETWORK CONTROL ENERGY 
AND SMALLEST EIGENVALUE OF R-MATRIX 

Strong correlation between the average network control energy, {E), and the smallest eigenvalue 
of the R-matrix, A^j^, for ER random and BA scale-free networks can be observed in Eig. IA51 
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Figure A5 | Correlation between network control energy and the smallest eigenvalue of 
if-matrix. Network size is iV = 100, directional link probability between any pair of nodes is Pb = 0.1, 
and average degree is (a) (k) = 6 for ER random networks and (b) (k) = 8 for BA sale-free networks. 


indicating that the network control energy is essentially determined by the smallest eigenvalue of 
its //-matrix. 
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APPENDIX E: LCC-SKELETON AND DOUBLE-CHAIN INTERACTION MOD¬ 
ELS 


El: LCC-skeleton model. Referring to Fig. 4 in the main text, we assume that the control 
chains are independent of each other so that E 2 is negligible as compared with Ei. Each control 
chain is effectively a ID string. Due to the exponential increase in energy via chain length incre¬ 
ment, El can be regarded as the sum of control energies associated with the set of unidirectional 
ID strings, to which the contribution of the LCC dominates. The required energy to control the 
full network can thus be approximated as that required to control all LCCs, 


E = El -|- £"2 ~ El ~ m • El 


m 


A 


-1 

Hl' 


(A34) 


where El denotes the energy required to control an LCC, Xhj^ is the smallest eigenvalue of the 
LCC’s H matrix Hl, and m denotes the degeneracy (multiplicity) of the LCC, as shown in Fig. 4(b) 
in the main text. Results presented in Fig. 3(b) of the main text demonstrate a positive correlation 
between E and m- El, reinforcing the idea the independent LCCs are the key topological structure 
dictating the energy required to control the whole network. In particular, if a network contains 
long LCCs (as can be determined straightforwardly by maximum matching from the structural 
controllability theory 3^), there is high likelihood that it cannot be practically controlled as 
practically the required energy would diverge. 

Reasoning from an alternative standpoint, an arbitrary combination of Dq and m effectively 
represents a network, as shown in Fig. 4(b) in the main text, and the entire network ensemble 
can be represented by the ensemble of all possible combinations of LCCs. In the LCC ensemble, 
the quantities Dq and m emerge according to their probability density functions, Pdq{Dc) and 
Pm{m), respectively, and the appearance of an arbitrary pair of Dq and m is determined by their 
joint probability density function P{DQ,m). Consequently, the distribution of the energy required 
to control the original network can be characterized accurately by the distribution of the energy 
required to control the LCC skeleton in the corresponding ensemble. 

Figure lAGl fai shows the distribution of the control diameter Dq, essentially the length distribu¬ 
tion of LCCs. The probability density function decays approximately exponentially with Dq, so 
we write 


-Pdc(-C’c) = a - e 


-b-Dc 


(A35) 


where a and b are positive constants. Using the relationship between El and Dq [e.g., Fig. 3(a) 
in the main text], we have 

El-A-^Dq^ ^In^, (A36) 

where A and B are positive constants. The probability density function of El can then obtained 


as 


Pl{El) = PDoi^ln^: 


d(iln^) 

dEi 


— Ab ■ Er ® . 

B ^ 


(A37) 


In the ER random network ensemble, the probability density of LCC degeneracy m for networks 
with Dq > 2 also exhibits an exponential decay, as shown in Fig. IA6l bL 


Pm{m) = c • e 


(A38) 


where c and d are positive constants. 
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Figure A6 | Probability distribution of control diameter and its degeneracy, (a) Distribution of 
Dc (blue) and the probability that an LCC of length Dq appears in the random network ensemble versus 
Dc- (b) Probability distribution of LCC-degeneracy m for networks with Dc > 2. 


Since the control energy depends monotonously on the control diameter Dq, the energy depen¬ 
dence on m can be revealed by examining the correlation between Dq and m, which can in general 
be either positive or negative. From Eq. (|A38p . we see that Pm{m) increases exponentially with 
m, implying a positive correlation: 

Dc = si + S2- m, (A39) 


where si and S 2 are positive constants. This form of relation ensures that Pm{m) has the form in 
Eq. ()A38|1 . Positive correlation, however, means that the number of LCCs increases with its length, 
which is unphysical for random networks. These arguments suggest that a contradiction can arise 
if we assume either positive or negative correlation between Dq and m. A natural resolution is 
that these two quantities are independent of each other. Since El, the energy required to control 
a chain of length L, is determined mainly by the control diameter Dq, El and m can be assumed 
to be independent of each other so that their joint probability density function can be expressed 
as P{EL,m) K. Pl{El) ■ Pm{m). 

Having obtained Pl{El) and Pm{m), we can calculate the cumulative probability distribution 
function of the estimated control energy E = m ■ El required to control the original network. We 
have 

^oo 

Fe{E) = P{m ■ El < E) = [ P(Ei, m) • dm] dE^ (A40) 

Jo Jo 

oo — 


where r(^) and r{-^,gE) are the Gamma and incomplete Gamma function, respectively. Thus, 
the probability density function of E can then be expressed as 


Pe{E) = 


dFE{E) 

dE 


caAe 


g 9^ ^ ^ 


{gE)-d+i^, (A41) 
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where the first term —e~^^/E can be neglected due to the typically large value of E. Since 
we observe numerically that the difference between the two Gamma functions is approximately 
constant: /i(r) = r(-^) — V{^^gE) k. 1.7, we can simplify Eq. (jA4ip as 

Pe{E) K, C ■ (A42) 

where C = • hr] is a positive constant. Equation (jA42p indicates a power-law 

distribution of the control energy, providing an analytical explanation to the numerically discovered 
energy distribution for practically controllable networks, as exemplified in Fig. 3 in the main text. 
To get a rough idea about the value of the power-law scaling exponent, say we take ~ 2 and 6 ~ 1 
(typical numerical values). A theoretical estimate of the power-law exponent is thus 1 + b/B ~ 1.5, 
which is consistent with the value obtained from results from direct numerical simulation. The fact 
that the distribution of E^ is power law with the identical exponent provides additional support for 
our assumption that the LCC degeneracy m plays little role in determining the control energy. It 
is the combination of the exponential decay in the probability distribution of the control diameter 
[cf., Eq. (IA35I) ] and the exponential increase in the energy required to control LCC with its length 
[cf., Eq. ()A36p ] that gives rise to the power-law energy distribution of the LCCs, which ultimately 
leads to the power-law distribution in the actual energy required to control the original random 
network. 

We see that the control diameter of a network is a key quantity determining the required control 
energy. The topological diameter, on the other hand, is a fundamental quantity characterizing, for 
example, the small-world structure of the network [ij. An interesting issue concerns the relation 
between the control and topological diameters. In particular, if the network has a large diameter, 
does it mean that its control diameter must be large as well? This issue has been addressed, with 
the Ending that there is little correlation between the two types of diameters. 

E2: Double-chain interaction model. Our analysis of the LCC-skeleton model predicts 
power-law distribution of the required energy for practically controllable networks, which agrees 
qualitatively with numerics. However, in the model interactions among the coexisting chains are 
ignored. In a physical system, interactions among the basic components usually plays an im¬ 
portant role in determining the system’s properties. To obtain a more accurate estimate of the 
behaviors of the control energy, we need to include the interactions among the chains. The neces¬ 
sity is further justified as there are discrepancies between the actual control energy and that from 
the LCC-skeleton model, as exemplified in Fig. 3(b) in the main text. In particular, there is an 
approximately continuous distribution in the energy required to control the actual network, but 
the distribution of the energy from the LCC-skeleton model tends to aggregate into a number of 
subintervals, each corresponding to a certain value of the control diameter associated with an LCC. 
Thus, in order to reproduce the numerically obtained energy distributions, we must incorporate the 
interactions among the LCCs into the model. However, including the interactions makes analysis 
difficult, as there are typically a large number of interacting pairs of chains. To gain insight into the 
role played by the interactions, it is useful to focus on the relatively simple case of two interacting 
chains. 

Our double-chain interaction model is constructed, as follows. Consider two identical unidirec¬ 
tional chains, denoted by Cl and (72, each of length Dq- Every node in Cl connects with every 
node in (72 with probability p, all links between the two chains are unidirectional. A link points 
to (72 from Cl with probability pi ^2 and the probability for a link in the opposite direction is 











3 Double-LCC interaction: 



b LCC-non-LCC interaction: 




Figure A7 | Distribution of control energy in a double-chain interaction models, (a) Two 

chains of Dq = 5 (red) and Dq = 6 (black), where the upper panel is a schematic illustration of two LCCs 
of identical length Dq = 5 interacting with each other via some random links (green) between them, (b) 
Two chains with their lengths randomly chosen from 3 — 6, where the upper panel shows the case of two 
interacting chains of length 5 and 3. The longer chain (red) plays the role of LCC, while the shorter chain 
is a non-LCC (orange). 


P 2 ^i = 1 —pi^ 2 - By changing the connection rate p and the directional bias pi^. 2 , we can simulate 
and characterize various interaction patterns between the two chains. To be concrete, we generate 
an ensemble of 10000 interacting double-chain networks, each with 2 ■ Dq nodes and multiple ran¬ 
domized interchain links as determined by the parameters p and pi^ 2 - As shown in Fig. IA7l a). 
the distribution of the control energy displays a remarkable similarity to that for random networks, 
in that a power-law scaling behavior emerges with the exponent about 1.5. A striking result is 
that the energy distributions from the double-chain interaction model are much more smooth than 
those from the LCC-skeleton model, indicating the key role played by the interchain interactions 
in spreading out the control energies that are clustered when the interactions are absent. The 
power-law distribution holds robustly with respect to variations in the parameters p and pi^ 2 - In 
addition, to reveal the role of the interaction between an LCC and a non-LCC chain in the control 
energy, we randomly pick their lengths from [3, 6] with equal probability, where the longer chain 
acts as an LCC. Again, we observe a strong similarity between the energy distributions from ran¬ 
dom networks and from this model, as shown in Fig. IA7l b) , suggesting a universal pattern followed 
by pair interactions, regardless of the length of the chains. In particular, interactions between two 
chains, LCC or not, have similar effect on the control-energy distribution. These results indicate 
that the double-chain interaction model captures the essential physical ingredients of the energy 
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TABLE Al: Description of the 18 real-world networks used in the paper (A - number of nodes; M 
number of edges). 


Type 

Index 

Name 

N 

M 

Description 

Trust 

1 

College Student [M, ^6] 

32 

96 

Social network 


2 

Prison Inmate [M, ^6] 

67 

182 

Social network 


3 

s208a 

122 

189 

Logic circuit 

Circuits 

4 

s420a 

252 

399 

Logic circuit 


5 

s838a 

512 

189 

Logic circuit 

Citation 

6 

Small World [Ij 

233 

1988 

Stanley Milgram 

7 

Kohonen [58] 

3772 

96 

T. Kohonen 


8 

Protein-1 

95 

213 

Protein network 

Protein 

9 

Protein-2 

53 

123 

Protein network 


10 

Protein-3 

99 

212 

Protein network 


11 

St. Martin 

45 

224 

Food Web 


12 

Seagrass [6^ 

49 

226 

Food Web 


13 

Grassland 

88 

137 

Food Web 

Food Web 

14 

Ythan [61] 

135 

601 

Food Web 


15 

Silwood [62] 

154 

370 

Food Web 


16 

Little Rock [63] 

183 

2494 

Food Web 


17 

Baydry [6^ 

128 

2137 

Food Web 


18 

Florida [^ 

128 

2106 

Food Web 


distribution in controlling complex networks. 

APPENDIX F: THE PRACTICAL CONTROLLABILITY OF REAL-WORLD 
NETWORKS 

Table lAll lists the names and types of the real-world networks studied and Table IA2I presents 
more detailed information about the controllability of the 18 real-world networks analyzed in the 
main text. 
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TABLE A2: Practical controllability of real-world networks studied in Ref. [Sg, where Ad denotes the 
number of controllers as determined by the structural controllability theory and M* is the number of 
augmented driver nodes needed to make the network practically controllable. The densities of the original 
driver nodes and with M* augmented drivers included are riD = Ad/A and = A^ / A, respectively, 

where measure of controllability, and E* denotes the new energy. When an additional 

control signal is added to the middle of each LCC [strategy (I)] so that extra control inputs are used, 
the control energy is EE^. We also test another strategy in which an extra control signal is applied to 
each of the m convergent nodes of all LCCs [strategy (II)], in which additional control inputs are 
used. The control energy required is denoted as E*^^. The control diameter Dq of each network is listed in 
the last column. (See Table A1 in Appendix F for detailed description of the 18 networks.) 


Type 

Name 

N 

No 

M* 

ran 


E* 

^mid 

■^mid 

^:nd 

^e*nd 

Dc 

Trust 

Coll. Student 

32 

6 

0 

0.19 

0.19 

O 

o 

r— 1 

X 

3 

2.5 

X 

10® 

4 

7.3 

X 

lO'^ 

4 


Prison Inmate 

67 

9 

27 

0.13 

0.54 

2.3 X 10^ 

1 

4.6 

X 

10® 

1 

9.7 

X 

10^ 

5 

Electronic Circuits 

s208a 

122 

29 

40 

0.24 

0.57 

2.3 X 10^ 

1 

3.0 

X 

10® 

1 

4.0 

X 

10® 

5 


s420a 

252 

59 

71 

0.23 

0.52 

2.6 X 10® 

11 

2.1 

X 

10® 

10 

2.2 

X 

10® 

4 


s838a 

512 

119 

81 

0.23 

0.39 

4.1 X 10® 

13 

3.1 

X 

10® 

6 

3.7 

X 

10® 

5 

Citation 

Small World 

233 

140 

11 

0.60 

0.65 

2.0 X 10^ 

1 

1.9 

X 

10® 

1 

1.9 

X 

10® 

5 


Kohonen 

3772 2114 413 

0.56 

0.67 

9.4 X 10"^ 

49 

4.0 

X 

10^ 

37 

5.1 

X 

10^ 

3 

Protein 

Protein-1 

95 

48 

19 

0.51 

0.71 

1.9 X 10^® 

7 

8.5 

X 

10^ 

5 

2.2 

X 

10® 

3 


Protein-2 

53 

13 

0 

0.25 

0.25 

3.7 X 10® 

2 

9.0 

X 

10® 

2 

1.2 

X 

10® 

4 


Protein-3 

99 

22 

35 

0.22 

0.58 

3.9 X 10® 

3 

5.1 

X 

lO'^ 

3 

5.3 

X 

10^ 

4 

Food Web 

St. Martin 

45 

14 

9 

0.31 

0.51 

2.7 X 10® 

2 

9.2 

X 

10^ 

2 

1.5 

X 

10® 

3 


Seagrass 

49 

13 

7 

0.27 

0.41 

3.6 X 10® 

3 

8.3 

X 

10^ 

2 

3.4 

X 

10® 

3 


Grassland 

88 

46 

0 

0.52 

0.52 

3.4 X 10® 

1 

8.6 

X 

lO'^ 

1 

9.2 

X 

10^ 

4 


Ythan 

135 

69 

14 

0.51 

0.62 

2.6 X 10® 

4 

1.9 

X 

10® 

2 

2.0 

X 

10® 

3 


Silwood 

154 

116 

12 

0.75 

0.83 

1.5 X 10"^ 

3 

1.0 

X 

10® 

2 

9.9 

X 

102 

3 


Little Rock 

183 

99 

36 

0.54 

0.74 

5.4 X 10® 

48 

3.6 

X 

10® 

48 

3.6 

X 

10® 

2 


Baydry 

128 

62 

34 

0.48 

0.75 

1.7 X 10® 

32 

9.6 

X 

10^ 

48 

9.6 

X 

102 

2 


Florida 

128 

30 

69 

0.23 

0.77 

NaN 

1 


NaN 

1 


NaN 

3 


31 






Figure A8 | Effects of redundant control inputs, (a) For control diameter Dc = 2, distribution of 
the energy ratio E^/E under optimization strategies (I) (Mid, red circles) and (II) (End, green squares), 
where E^ and E are the required energies with and without redundant control, respectively. Results from 
two randomized optimization strategies are marked by R-Mid (black triangles) and R-End (gray 
diamonds), corresponding to strategies (I) and (II), respectively. The values of E^/E are collected from 
practically controllable networks from an ensemble of 10000 ER random networks ((fc) = 6, Pb = 0.1). For 
each network, if strategy (I) [or (II)] requires r redundant controls, r additional random control inputs are 
applied to the system 10 times to average out the random fluctuations. Panels (b-d) show the E^/E 
distributions for networks with control diameter Dq = 3, 4, or 5, respectively. 


APPENDIX G: ENERGY OPTIMIZATION OF MODELED COMPLEX NET¬ 
WORKS AND A CASCADE PARALLEL R-C CIRCUIT NETWORK 

Gl: Optimization strategy for modeled networks. A realistic complex network can often 
have multiple LCCs, requiring multiple redundant control inputs. Say we wish to introduce a 
small number of extra control signals. Due to the m degeneracy in the end nodes LCCs, it seems 
that the number of redundant control inputs should exceed m if every LCC receives one such 
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signal. However, since even a unity deduction in the LCC length can significantly lower the control 
energy, a simpler strategy is to place one redundant control input at each of the m end-nodes 
to which all possible LCCs converge. In this case, each LCC in the network is broken into a 
chain of length L — 1 and a single node, and consequently, the control energy is now determined 
by one-dimensional chains of length L — 1 instead of length L. Figure lASl al shows the effects 
of two optimization strategies to introduce redundant control signals on the energy distribution: 
applying one redundant control signal (I) at the middle and (II) at the end of each and every LCC, 
respectively. For comparison, for each strategy, the same number of redundant control inputs 
are also applied randomly throughout the network. The ratio between the control energy under 
optimization strategy. Ex, and the original control energy E characterizes the effectiveness of the 
optimization strategies. In particular, if the distribution of Ex/E is concentrated on small values 
of Ex/E, then the corresponding optimization strategy can be deemed to be effective. As shown 
in Fig. IA8l a). both optimization strategies outperform the random strategies, with strategy (I) 
performing slightly better than (II). The networks requiring proper optimization to be practically 
controlled are typically those with long control diameters. Figures [A8l b-dl show that this is indeed 
the case. 

G2: An example of controlling and optimizing a circuit system. We consider a cascade 
parallel R-C circuit consisting of three identical resistors and capacitors as an example to illustrate 
how the circuit can be abstracted into a directed network, as shown in Fig. IA9[ For convenience, 
we set Ri = R 2 = R 3 = R and Ci = C 2 = C 3 = C, and denote the currents through Ri, R 2 , and 
i ?3 as ii{t) , i 2 {t), and i 3 {t), respectively. The equations of the circuit are 

u{t) = ii{t)R + ui{t) 

Ul{t) = i2{t)R + U2{t) 

U 2 {t) = i 3 {t)R + U 3 {t) 

= m - k(t) 
c- 


(A43) 


dt 

. = ^3(t) 


dt 


After some algebraic manipulation, we have 

+ 7^^2(i) + -^u{t) 

^jU3{t) 




which can be written as 

dtii (t) 




(A44) 


Ul{t) \ 

Mt) 

\ Mt) / 



(A45) 


Setting R = in and C = IE, we have 

/ dui{t) 
dt 

du2 (t) 
dt 

du3{t) 
\ dt 

where 


= A- 


ui(t) \ 
U2(t) 

V Mi) / 


A = 


/ -2 1 0 

1 -2 1 

^ 0 1-1 


-I- B ■ u(t). 


(A46) 


(A47) 
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Figure A9 | Controlling and optimizing a cascade parallel R-C circuit and the corresponding 
network presentation, (a) A cascade parallel R-C circuit with 3 resistors (i?i, R 2 , and i? 3 , each of 
resistance 117) and 3 capacitors (Ci, C 2 , and C 3 , each of capacitance IF), where u{t) is the external input 
voltage, ui(t), U 2 (t), and U 3 {t) are the voltages on the capacitors Ci, C 2 , and C 3 , respectively, i 2 (t), 
and 13 ( 1 ) are the currents through the resistors Ri, R 2 , and R 3 , respectively, (b) Network representation of 
the circuit in (a), (c) Circuit with an extra external current input ie(t) into the capacitor C 2 - (d) The 
extra external current input ie{t) serves as a redundant control input injected into node 2 of the network 
in (b). There are two driver nodes (yellow) in the network: 1 and 3. 


is the adjacency matrix of the network representing the circuit, and 


B = 


(l\ 

0 


VO/ 


(A48) 


is the control input matrix. The circuit has then been transferred into a 3-node bidirectional ID 
chain network with adjacency matrix A. 

Without loss of generality, we inject an extra external current input ie{t) into the capacitor C2, 


34 











































and the circuit equations become: 


The state equations are 


where 


u{t) = ii{t)R + ui{t) 
uiit) = i2{t)R + U2{t) 

U2{t) = i3{t)R + U3{t) 

^ = i2{t) - hit) + hit) 


c 


d^3 (t) 


dt 


= hit) 


/ dni(t) 
dt 

du2 (t) 
dt 

dtt3 (t) 

dt 


= A- 


Ul 


it)] 


/ 

U 2 it) I +Be 
\U 3 it) ) 




(A49) 


(A50) 


(A51) 


is the control input matrix of the circuit under the original control input u(t) on node 1 and a 
redundant control input hit) to node 2. Similarly, the redundant control input can be injected 
into any capacitor. 

It is necessary to keep all other nodes unaffected while introducing exactly one extra control 
input into the circuit. However, any additional voltage change in any part of the circuit can lead 
to voltage changes on all the capacitors. A change in the current through a capacitor will not 
affect the currents in other components of the network, since only the time derivative of its voltage 
is affected. Thus, a meaningful way to introduce an extra control signal input to one node of a 
circuit’s network is to inject current into one particular capacitor in the circuit. 
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